Population Genomics
   HOME

TheInfoList



OR:

Population genomics is the large-scale comparison of DNA sequences of populations. Population genomics is a
neologism A neologism Greek νέο- ''néo''(="new") and λόγος /''lógos'' meaning "speech, utterance"] is a relatively recent or isolated term, word, or phrase that may be in the process of entering common use, but that has not been fully accepted int ...
that is associated with
population genetics Population genetics is a subfield of genetics that deals with genetic differences within and between populations, and is a part of evolutionary biology. Studies in this branch of biology examine such phenomena as adaptation, speciation, and pop ...
. Population genomics studies
genome In the fields of molecular biology and genetics, a genome is all the genetic information of an organism. It consists of nucleotide sequences of DNA (or RNA in RNA viruses). The nuclear genome includes protein-coding genes and non-coding ge ...
-wide effects to improve our understanding of
microevolution Microevolution is the change in allele frequencies that occurs over time within a population. This change is due to four different processes: mutation, selection (natural and artificial), gene flow and genetic drift. This change happens over a r ...
so that we may learn the
phylogenetic In biology, phylogenetics (; from Greek φυλή/ φῦλον [] "tribe, clan, race", and wikt:γενετικός, γενετικός [] "origin, source, birth") is the study of the evolutionary history and relationships among or within groups o ...
history and
demography Demography () is the statistics, statistical study of populations, especially human beings. Demographic analysis examines and measures the dimensions and Population dynamics, dynamics of populations; it can cover whole societies or groups ...
of a population.Luikart, G.; England, P. R.; Tallmon, D.; Jordan S.; Taberlet P. (2003). "The Power and Promise of Population Genomics: From Genotyping to Genome Typing". ''Nature Reviews'' (4): 981-994


History

Population genomics has been of interest to scientists since Darwin. Some of the first methods used for studying genetic variability at multiple loci included gel electrophoresis and restriction enzyme mapping. Previously
genomics Genomics is an interdisciplinary field of biology focusing on the structure, function, evolution, mapping, and editing of genomes. A genome is an organism's complete set of DNA, including all of its genes as well as its hierarchical, three-dim ...
was restricted to only the study of a low amount of loci. However recent advancements in sequencing and computer storage and power have allowed for the study of hundreds of thousands of loci from populations. Analysis of this data requires identification of non-neutral or outlier loci that indicate selection in that region of the genome. This will allow the researcher to remove these loci to study genome wide effects or to focus on these loci if they are of interest.


Research applications

In the study of ''
Schizosaccharomyces pombe ''Schizosaccharomyces pombe'', also called "fission yeast", is a species of yeast used in traditional brewing and as a model organism in molecular and cell biology. It is a unicellular eukaryote, whose cells are rod-shaped. Cells typically measur ...
'' (more commonly known as fission yeast), a popular model organism, population genomics has been used to understand the reason for the phenotypic variation within a species. However, since the genetic variation within this species was previously poorly understood due to technological restrictions, population genomics allows us to learn about the species' genetic differences. In the human population, population genomics has been used to study the genetic change since humans began to migrate away from Africa approximately 50,000-100,000 years ago. It has been shown that not only were genes related to fertility and reproduction highly selected for, but also that the further humans moved away from Africa, the greater the presence of lactase. A 2007 study done by Begun ''et al.'' compared the whole genome sequence of multiple lines of ''
Drosophila simulans ''Drosophila simulans'' is a species of fly closely related to ''D. melanogaster'', belonging to the same ''melanogaster'' species subgroup. Its closest relatives are ''D. mauritiana'' and ''D. sechellia''. Taxonomy This species was discovere ...
'' to the assembly of ''
D. melanogaster ''Drosophila melanogaster'' is a species of fly (the taxonomic order Diptera) in the family Drosophilidae. The species is often referred to as the fruit fly or lesser fruit fly, or less commonly the " vinegar fly" or "pomace fly". Starting with ...
'' and '' D. yakuba''. This was done by aligning DNA from whole genome shotgun sequences of ''D. simulans'' to a standard reference sequence before carrying out whole genome analysis of polymorphism and divergence. This revealed a large number of proteins that had experienced
directional selection In population genetics, directional selection, is a mode of negative natural selection in which an extreme phenotype is favored over other phenotypes, causing the allele frequency to shift over time in the direction of that phenotype. Under dir ...
. They discovered previously unknown, large scale fluctuations in both polymorphism and divergence along chromosome arms. They found that the X chromosome had faster divergence and significantly less polymorphism than previously expected. They also found regions of the genome (e.g. UTRs) that signaled adaptive evolution. In 2014 Jacquot ''et al.'' studied the diversification and epidemiology of endemic bacterial pathogens by using the ''
Borrelia burgdorferi ''Borrelia burgdorferi'' is a bacterial species of the spirochete class in the genus '' Borrelia'', and is one of the causative agents of Lyme disease in humans. Along with a few similar genospecies, some of which also cause Lyme disease, it mak ...
'' species complex (the bacteria responsible for Lyme disease) as a model. They also wished to compare the genetic structure between ''B. burgdorferi'' and the closely related species '' B. garinii'' and '' B. afzelii''. They began by sequencing samples from a culture and then mapping the raw read onto reference sequences. SNP based and phylogenetic analyses were used on both intraspecific and interspecific levels. When looking at the degree of genetic isolation, they found that intraspecific recombination rate was ~50 times higher than the interspecific rate. They also found that by using most of the genome conspecific strains didn’t cluster in clades, raising questions about previous strategies used when investigating pathogen epidemiology. Moore ''et al'' conducted a study in 2014 in which a group of
Atlantic Salmon The Atlantic salmon (''Salmo salar'') is a species of ray-finned fish in the family Salmonidae. It is the third largest of the Salmonidae, behind Siberian taimen and Pacific Chinook salmon, growing up to a meter in length. Atlantic salmon are ...
populations which were previously analyzed with traditional population genetic analyses (
microsatellites A microsatellite is a tract of repetitive DNA in which certain DNA motifs (ranging in length from one to six or more base pairs) are repeated, typically 5–50 times. Microsatellites occur at thousands of locations within an organism's genome. ...
, SNP-array genotyping, BayeScan (which uses the
Dirichlet-multinomial distribution In probability theory and statistics, the Dirichlet-multinomial distribution is a family of discrete multivariate probability distributions on a finite support of non-negative integers. It is also called the Dirichlet compound multinomial distribut ...
)) to place them into defined conservational units. This genomic assessment mostly agreed with previous results, but did identify more differences between regionally and genetically discrete groups, suggesting there were potentially even greater number of conservation units of salmon in those regions. These results verified the usefulness of genome-wide analysis in order to improve the accuracy of future designation of conservation units. In highly migratory marine species, traditional population genetic analyses often fail to identify population structure. In tunas, traditional markers such as short-range PCR products, microsatellites and SNP-arrays have struggled to distinguish
fish stocks Fish stocks are subpopulations of a particular species of fish, for which intrinsic parameters (growth, recruitment, mortality and fishing mortality) are traditionally regarded as the significant factors determining the stock's population dyna ...
from separate ocean basins. However, population genomic research using
RAD RAD or Rad may refer to: People * Robert Anthony Rad Dougall (born 1951), South African former racing driver * Rad Hourani, Canadian fashion designer and artist * Nickname of Leonardus Rad Kortenhorst (1886–1963), Dutch politician * Radley R ...
sequencing in
yellowfin tuna The yellowfin tuna (''Thunnus albacares'') is a species of tuna found in pelagic waters of tropical and subtropical oceans worldwide. Yellowfin is often marketed as ahi, from the Hawaiian , a name also used there for the closely related bigeye ...
and
albacore The albacore (''Thunnus alalunga''), known also as the longfin tuna, is a species of tuna of the order Perciformes. It is found in temperate and tropical waters across the globe in the epipelagic and mesopelagic zones. There are six distinct s ...
has been able to distinguish populations from different ocean basins and reveal fine-scale population structure. These studies identify putatively adaptive loci that reveal strong population structure, even though these sites represent a relatively small proportion of the overall DNA sequence data. In contrast, the majority of sequenced loci that are presumed to be selectively neutral do not reveal patterns of population differentiation, matching results for traditional DNA markers. The same pattern of putatively adaptive loci and RAD sequencing revealing population structure, compared to limited insight provided by traditional DNA markers is also observed for other marine fishes, including
striped marlin The striped marlin (''Kajikia audax'') is a species of marlin found in tropical to temperate Indo-Pacific oceans not far from the surface. It is a desirable commercial and game fish. The striped marlin is a predator that hunts during the day in t ...
and
lingcod The lingcod or ling cod (''Ophiodon elongatus''), also known as the buffalo cod or cultus cod, is a fish of the greenling family Hexagrammidae. It is the only extant member of the genus ''Ophiodon. ''A slightly larger, extinct species, '' Ophi ...
.


Mathematical models

Understanding and analyzing the vast data that comes from population genomics studies requires various mathematical models. One method of analyzing this vast data is through
QTL A quantitative trait locus (QTL) is a locus (section of DNA) that correlates with variation of a quantitative trait in the phenotype of a population of organisms. QTLs are mapped by identifying which molecular markers (such as SNPs or AFLPs) c ...
mapping. QTL mapping has been used to help find the genes that are responsible for adaptive phenotypes. To quantify the genetic diversity within a population a value known as the
fixation index The fixation index (FST) is a measure of population differentiation due to genetic structure. It is frequently estimated from genetic polymorphism data, such as single-nucleotide polymorphisms (SNP) or microsatellites. Developed as a special case ...
, or FST is used. When used with
Tajima's D Tajima's D is a population genetic test statistic created by and named after the Japanese researcher Fumio Tajima. Tajima's D is computed as the difference between two measures of genetic diversity: the mean number of pairwise differences and the ...
, FST has been used to show how selection acts upon a population. The McDonald-Kreitman test (or MK test) is also favored when looking for selection because it is not as sensitive to changes in a species' demography that would throw off other selection tests.


Future developments

Most developments within population genomics have to do with increases in the sequencing technology. For example, restriction-site associated DNA sequencing, or RADSeq is a relatively new technology that sequences at a lower complexity and delivers higher resolution at a reasonable cost. High-throughput sequencing technologies are also a rapidly growing field that allows for more information to be gathered on genomic divergence during speciation. High-throughput sequencing is also very useful for SNP detection, which plays a key role in personalized medicine. Another relatively new approach is reduced-representation library (RRL) sequencing which discovers and genotypes SNPs and also doesn't require reference genomes.


See also

*
Genomics Genomics is an interdisciplinary field of biology focusing on the structure, function, evolution, mapping, and editing of genomes. A genome is an organism's complete set of DNA, including all of its genes as well as its hierarchical, three-dim ...
*
Landscape genomics Landscape genomics is one of many strategies used to identify relationships between environmental factors and the genetic adaptation of organisms in response to these factors. Landscape genomics combines aspects of landscape ecology, population gene ...
*
Personal genomics Personal genomics or consumer genetics is the branch of genomics concerned with the sequencing, analysis and interpretation of the genome of an individual. The genotyping stage employs different techniques, including single-nucleotide polymorphi ...
*
Population groups in biomedicine Race and health refers to how being identified with a specific race influences health. Race is a complex concept that has changed across chronological eras and depends on both self-identification and social recognition. In the study of race and ...


Notes


External links


Population Genomics
from ''Nature Education''
The Simple Fool’s Guide to Population Genomics via RNA-Seq

Drosophila Population Genomics Project


References


Population genomics: a bridge from evolutionary history to genetic medicine
*
The Power and Promise of Population Genomics: From Genotyping to Genome Typing
* {{Population genetics Genomics Population genetics